Sparse random graphs: Eigenvalues and Eigenvectors 



Linh V. Tran, Van H. Vu* and Ke Wang 
Department of Mathematics, Rutgers, Piscataway, NJ 08854 



Abstract 

In this paper we prove the semi-circular law for the eigenvalues of regular random graph G n c j in the case 
d — > oo, complementing a previous result of McKay for fixed d. We also obtain a upper bound on the infinity norm 
of eigenvectors of Erdos-Renyi random graph G(n,p), answering a question raised by Dekel-Lee-Linial. 



1 Introduction 

1.1 Overview 

In this paper, we consider two models of random graphs, the Erdos-Renyi random graph G(n,p) 
and the random regular graph G n ^- Given a real number p = p(n),0 < p < 1, the Erdos-Renyi 
graph on a vertex set of size n is obtained by drawing an edge between each pair of vertices, 
randomly and independently, with probability p. On the other hand, G nt d, where d = d(n) 
denotes the degree, is a random graph chosen uniformly from the set of all simple d-regular 
graphs on n vertices. These are basic models in the theory of random graphs. For further 
information, we refer the readers to the excellent monographs [4J, [19] and survey [53] . 

Given a graph G on n vertices, the adjacency matrix A of G is an n x n matrix whose entry 
equals one if there is an edge between the vertices % and j and zero otherwise. All diagonal entries 
an are defined to be zero. The eigenvalues and eigenvectors of A carry valuable information 
about the structure of the graph and have been studied by many researchers for quite some time, 
with both theoretical and practical motivations (see, for example, [2], [3], [12], [25] [16], [13], [T5] . 

E].[ii].[!ii]. m\,m\)- 

The goal of this paper is to study the eigenvalues and eigenvectors of G(n,p) and G n ^. We 
are going to consider: 
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The global law for the limit of the empirical spectral distribution (ESD) of adjacency 
matrices of G(n,p) and G n ^. For p = u(l/n), it is well-known that eigenvalues of G{n,p) 
(after a proper scaling) follows Wigner's semicircle law (we include a short proof in the 



Appendix A for completeness). Our main new result shows that the same law holds for 
random regular graph with d — > oo with n. This complements the well known result of 
McKay for the case when d is an absolute constant (McKay's law) and extends recent 



results of Dumitriu and Pal [9] (see Section 1.2 for more discussion) 



Bound on the infinity norm of the eigenvectors. We first prove that the infinity norm 
of any (unit) eigenvector v of G(n,p) is almost surely o(l) for p = u(logn/n). This 
gives a positive answer to a question raised by Dekel, Lee and Linial [7]. Further- 



more, we can show that v satisfies the bound ||f ||oo = O log ' g(n)\ogn/np j for 

p = co (log n/n) = g{n) log n/n, as long as the corresponding eigenvalue is bounded away 
from the (normalized) extremal values —2 and 2. 



We finish this section with some notation and conventions. 

Given an n x n symmetric matrix M, we denote its n eigenvalues as 

Ai(M)<A 2 (M)<...<A n (M), 
and let ui(M), . . . , u n (M) G W 1 be an orthonormal basis of eigenvectors of M with 

Mui(M) = XiUi(M). 

The empirical spectral distribution (ESD) of the matrix M is a one-dimensional function 

F?(x) = ±\{l<j<n:\ J (N[)<x}\, 



where we use |I| to denote the cardinality of a set I. 

Let A n be the adjacency matrix of G(n,p). Thus A n is a random symmetric nxn matrix whose 
upper triangular entries are iid copies of a real random variable £ and diagonal entries are 0. 
£ is a Bernoulli random variable that takes values 1 with probability p and with probability 
1 — p. 

E£ = p, Var£ = p(l — p) = a 2 . 
Usually it is more convenient to study the normalized matrix 

M n = -{A n -pJ n ) 
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where J n is the n x n matrix all of whose entries are 1. M n has entries with mean zero and 
variance one. The global properties of the eigenvalues of A n and M n are essentially the same 
(after proper scaling), thanks to the following lemma 

Lemma 1.1. (Lemma 36, fSUtf) Let A,B be symmetric matrices of the same size where B has 
rank one. Then for any interval I, 

\N I (A + B)-N I (A)\<1, 

where Nj(M) is the number of eigenvalues of M in I . 

Definition 1.2. Let E be an event depending on n. Then E holds with overwhelming probability 
ifP(E) > 1 -exp(-w(logn)). 

The main advantage of this definition is that if we have a polynomial number of events, each of 
which holds with overwhelming probability, then their intersection also holds with overwhelming 
probability. 

Asymptotic notation is used under the assumption that n — > oo. For functions / and g of 
parameter n, we use the following notation as n — > oo: / = 0(g) if l/l/l^l is bounded from 
above; / = o(g) if f/g -> 0; / = u(g) if \ f\/\g\ -> oo, or equivalently, g = o(f); f = tt(g) if 
g = 0(f); f = Q(g) if / = 0(g) and g = 0(f). 



1.2 The semicircle law 

In 1950s, Wigner [32J discovered the famous semi-circle for the limiting distribution of the 
eigenvalues of random matrices. His proof extends, without difficulty, to the adjacency matrix 
of G(n,p), given that np — > oo with n. (See Figure [l] for a numerical simulation) 

Theorem 1.3. For p = w(^), the empirical spectral distribution (ESD) of the matrix -^^A n 
converges in distribution to the semicircle distribution which has a density p sc (x) with support 
on [-2,2], 

p sc (x) := —a/4 - x 2 . 

271 

If np = 0(1), the semicircle law no longer holds. In this case, the graph almost surely has 
0(n) isolated vertices, so in the limiting distribution, the point will have positive constant 
mass. 

The case of random regular graph, G n ^, was considered by McKay [2T] about 30 years ago. 
He proved that if d is fixed, and n — > oo, then the limiting density function is 
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This is usually referred to as McKay or Kesten-McKay law. 

It is easy to verify that as d — > oo, if we normalize the variable x by yd — 1, then the above 
density converges to the semicircle distribution on [—2, 2]. In fact, a numerical simulation shows 
the convergence is quite fast (see Figure [2]). 



It is thus natural to conjecture that Theorem 1.3 holds for G n ^d with d — > oo. Let A' n be the 
adjacency matrix of G n ^ and set 

<= ( A 'n~-J)- 

d(\ _ d\ n 

n \ n I 

Conjecture 1.4. // d — > oo i/ien i/ie i?S'i) o/ ^i^n converges to the standard semicircle 
distribution. 



Nothing has been proved about this conjecture, until recently. In [9], Dimitriu and Pal showed 
that the conjecture holds for d tending to infinity slowly, d = n olyl \ Their method does not 
extend to larger d. 
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Figure 2: The probability density function of the ESD of 
Random (i-regular graphs with 1000 vertices 



We are going to establish Conjecture 1.4 in full generality. Our method is very different from 
that of p. 

Without loss of generality we may assume d < n/2, since the adjacency matrix of the com- 
plement graph of G Ut d may be written as J n — A' n , thus by Lemma [l.l| will have the spectrum 
interlacing between the set {— X n (A' n ), . . . , — Xi(A' n )}. Since the semi-circular distribution is 
symmetric, the ESD of G nt d will converges to semi-circular law if and only if the ESD of its 
complement does. 

Theorem 1.5. If d tends to infinity with n, then the empirical spectral distribution of ^M' n 
converges in distribution to the semicircle distribution. 



Theorem L5 is a direct consequence of the following stronger result, which shows convergence 
at small scales. For an interval / let Nj be the number of eigenvalues of M' n in /. 

Theorem 1.6. (Concentration for ESD of G n ^)- Let 5 > and consider the model G n> d- If d 
tends to oo as n — > oo then for any interval I C [-2,2] with length at least d'^^d' 1 / 10 log 1/5 d, 
we have 



with probability at least 1 — 0(exp(— cn\fd\ogd)). 



Remark 1.7. Theorem 1.6 implies that with probability 1 — o(l) ; for d = n e<yl \ the rank of 
G n ,d is at least n — n c for some constant < c < 1 ( which can be computed explicitly from the 
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lemmas). This is a partial result toward the conjecture by the second author that G n ,d almost 
surely has full rank (see JSTj ). 

1.3 Infinity norm of the eigenvectors 

Relatively little is known for eigenvectors in both random graph models under study. In [7], 
Dekel, Lee and Linial, motivated by the study of nodal domains, raised the following question. 

Question 1.8. Is it true that almost surely every eigenvector u ofG(n,p) has ||w||oo — 

Later, in their journal paper [8], the authors added one sharper question. 

Question 1.9. Is it true that almost surely every eigenvector u of G(n,p) has ||w||oo = 

n -l/2+o(l) Q 

The bound 

n -i/2+ (i) wag 

also conjectured by the second author of this paper in an NSF pro- 
posal (submitted Oct 2008). He and Tao [30] proved this bound for eigenvectors corresponding 
to the eigenvalues in the bulk of the spectrum for the case p = 1/2. If one defines the adjacency 
matrix by writting —1 for non-edges, then this bound holds for all eigenvectors [30] [29] . 

The above two questions were raised under the assumption that p is a constant in the interval 
(0, 1). For p depending on n, the statements may fail. If p < ^ 1 ~ £ ^ logn , then the graph has 
(with high probability) isolated vertices and so one cannot expect that ||w||oo = o(l) for every 
eigenvector u. We raise the following questions: 

Question 1.10. Assume p > ( 1+e ^ logn for some constant e > 0. Is it true that almost surely 
every eigenvector u ofG(n,p) has |H|oo — 

Question 1.11. Assume p > ( 1+e ^ logn j or some constant e > 0. Is it true that almost surely 
every eigenvector u ofG(n,p) has ||w||oo = n~^ 2+0 ^ ? 

Similarly, we can ask the above questions for G n ^'- 

Question 1.12. Assume d > (1 + e) logn for some constant e > 0. Is it true that almost surely 
every eigenvector u of G n ^ has ||«||oo = o(l)? 

Question 1.13. Assume d > (1 + e) logn for some constant e > 0. Is it true that almost surely 
every eigenvector u of G n< d has ||w||oo = n~ l / 2+0<ylS) ? 
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As far as random regular graphs is concerned, Dumitriu and Pal [9] and Brook and Linden- 
strauss [5] showed that for any normalized eigenvector of a sparse random regular graph is 
delocalized in the sense that one can not have too much mass on a small set of coordinates. 
The readers may want to consult their papers for explicit statements. 

We generalize our questions by the following conjectures: 

Conjecture 1.14. Assume p > ( - 1+<s ^ logra for some constant e > 0. Let v be a random unit 
vector whose distribution is uniform in the (n — 1) - dimensional unit sphere. Let u be a unit 
eigenvector of G(n,p) and w be any fixed n-dimensional vector. Then for any 5 > 

P(\w ■ u — w ■ v\ > 5) = o(l). 

Conjecture 1.15. Assume d > (1 + e) logn for some constant e > 0. Let v be a random unit 
vector whose distribution is uniform in the (n — 1) - dimensional unit sphere. Let u be a unit 
eigenvector of G n ^ arid w be any fixed n-dimensional vector. Then for any 5 > 

P(\w ■ u — w ■ v\> 5) = o(l). 



In this paper, we focus on G(n,p). Our main result settles (positively) Question 1.8 and almost 



Question 1.10 



This result follows from Corollary 2.3 obtained in Section 2. 
Theorem 1.16. (Infinity norm of eigenvectors) Letp = u(logn/n) and let A n be the adjacency 



matrix of G(n,p). Then there exists an orthonormal basis of eigenvectors of A n , {u\, 



u. 



.}■ 



such that for every 1 < i < n, 



o(l) almost surely. 



For Questions 1.9 and 1.11, we obtain a good quantitative bound for those eigenvectors which 
correspond to eigenvalues bounded away from the edge of the spectrum. 

For convenience, in the case when p = u(logn/n) e (0, 1), we write 

g{n) logn 



P 



n 



where g{n) is a positive function such that g{n) — > oo as n — > oo {gin) can tend to oo arbitrarily 
slowly) . 

Theorem 1.17. Assume p = g(n)logn/n e (0,1), where g(n) is defined as above. Let 
B n = -^=^A n . For any k > 0, and any 1 < i < n with Xi(B n ) G [—2 + k,2 — k], there 

exists a corresponding eigenvector Ui such that ||iti||oo = K [ 



probability. 



log g{n) log n 
np 



)with overwhelming 



The proofs are adaptations of a recent approach developed in random matrix theory (as in 
|30j,|29j,[TU], [II])- The main technical lemma is a concentration theorem about the number of 
eigenvalues on a finer scale for p = u(\ogn/n). 
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2 Semicircle law for regular random graphs 



2.1 Proof of Theorem 11.6 



We use the method of comparison. An important lemma is the following 

Lemma 2.1. If np — > oo then G(n,p) is np-regular with probability at least exp(— 0(n(np) 1//2 ) 



For the range p > log 2 n/n, Lemma 2.1 is a consequence of a result of Shamir and Upfal [26] 
(see also [2D])- For smaller values of np, McKay and Wormald [23] calculated precisely the 
probability that G(n,p) is np-regular, using the fact that the joint distribution of the degree 
sequence of G(n,p) can be approximated by a simple model derived from independent random 
variables with binomial distribution. Alternatively, one may calculate the same probability 
directly using the asymptotic formula for the number of (i-regular graphs on n vertices (again 
by McKay and Wormald [22])- Either way, for p = o(l/y/n), we know that 

P(G(n,p) is np-regular) > 0(exp(— nlog(y'np)). 

which is better than claimed in Lemma 12.11 



Another key ingredient is the following concentration lemma, which may be of independent 
interest. 

Lemma 2.2. Let M be anxn Hermitian random matrix whose off-diagonal entries £y are i.i.d. 
random variables with mean zero, variance 1 and < K for some common constant K. Fix 
5 > and assume that the forth moment M 4 := sup^- E ( | | 4 ) = o(n). Then for any interval 
I C [—2,2] whose length is at least il(5~ 2 ^ 3 (M^/n) 1 ^ 3 ), the number Nj of the eigenvalues of 
-^M which belong to I satisfies the following concentration inequality 

5 4 n 2 \I\\ 



P(|JV> -n J p sc (t)dt\ > 8n J p sc (t)dt) <4exp(-c- 



K 2 



Apply Lemma 2.2 for the normalized adjacency matrix M n of G(n,p) with K = 1/y/p we 
obtain 



Corollary 2.3. Consider the model G(n,p) with np — > oo as n — > oo and let 5 > 0. Then for 

i(np) 1 

\Nj — n p sc (x)dx\ > 5n p sc (x)dx 



any interval I C [—2,2] with length at least ( (5 4°^"^ 2 ) 1//5 > we have 



i 



with probability at most exp(— cn{np) 1 ^ 2 log(np)). 
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Remark 2.4. If one only needs the result for the bulk case I C [—2 + e, 2 — e] for an absolute 
constant e > then the minimum length of I can be improved to (jrr^ij^) 1 ^ ■ 



By Corollary 2.3| and Lemma 2^ the probability that Ni fails to be close to the expected value 



in the model G(n,p) is much smaller than the probability that G(n,p) is np-regular. Thus the 
probability that Nj fails to be close to the expected value in the model G n ^ where d = np is the 
ratio of the two former probabilities, which is 0(exp(— cn*/np\ognp)) for some small positive 



constant c. Thus, Theorem |1.6| is proved, depending on Lemma 2.2 which we turn to next 



2.2 Proof of Lemma 2.2 



Assume I = [a, b] and a — (—2) < 2 — b. 

We will use the approach of Guionnet and Zeitouni in [T8j. Consider a random Hermitian 
matrix W n with independent entries Wij with support in a compact region S. Let / be a real 
convex L-Lipschitz function and define 

n 

Z:=]C/(A0 

i=i 

where Aj's are the eigenvalues of -^W„. We are going to view Z as the function of the atom 
variables u>y. For our application we need Wij to be random variables with mean zero and 
variance 1, whose absolute values are bounded by a common constant K. 



The following concentration inequality is from [18] 

Lemma 2.5. Let W n , f, Z be as above. Then there is a constant c > such that for any T > 

P(|Z-E(Z)|>r)<4exp(-c-^). 



In order to apply Lemma 2J3 for Nj and M, it is natural to consider 

n 

Z:=N I = Y,Xi{\) 



i=l 



where xi is the indicator function of I and Aj are the eigenvalues of -^M n . However, this 
function is neither convex nor Lipschitz. As suggested in [TS], one can overcome this problem 
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by a proper approximation. Define Ii = [a — q,o\, I r = [b, b + ^] and construct two real 
functions /i,/2 as follows(see Figure [3]): 

{— (x — a) — 1 if x G (— oo, a — ^) 
o if x e I u k u J r 

ifxG(6+^,oo) 

{— |j| (x — a) — 1 if x G (— oo, a) 
-1 if 3 6/ 

£(a; - 6) - 1 if x G (6, oo) 

where C is a constant to be chosen later. Note that f/s are convex and jypLipschitz. Define 

n n 

*i = X)/i(Ai), X 2 = ^/ 2 (A,) 

i=i i=i 

<5„ 



and apply Lemma 2.5 with T = ~n fjp sc (t)dt for X x and X 2 . Thus, we have 

5 2 n 2 \I\ 2 (J lPsc (t)dt) 2 , 



S f (■ 
P(\Xj - E(Xj) | > -7i y p sc (t)dt) < 4exp(-c- 



K 2 C 2 



At this point we need to estimate the value of Lp sc {t)dt. There are two cases: if I is in the 
"bulk" i.e. I C [— 2 + e,2 — e] for some positive absolute constant e, then L p sc (t)dt = a\I\ where 
a is a constant depending on e. But if I is very near the edge of [—2, 2] i.e. a — (—2) < |/| = o(l), 
then J lPsc (t)dt = a'\I\ 3/2 for some absolute constant a'. Thus in both case we have 



6 f 

PdXj - E(X,)| > -n J p sc {t)dt) < 4exp(- Cl 



S 2 n 2 \V 5 
K 2 C< 



Let X = X 1 -X 2 , then 

5 f 5 2 n 2 \I\ b 
P(|X - E(X)| > -n J^dt) < 0(exp(- Cl -^)). 

Now we compare X to Z, making use of a result of Gotze and Tikhomirov [T7j. We have 
E(X — Z) < E(AT/ i +Nj r ). In |17j . Gotze and Tikhomirov obtained a convergence rate for ESD 
of Hermitian random matrices whose entries have mean zero and variance one, which implies 
that for any I C [-2, 2] 



|E(JVj) -7i Jpsc{t)dt\ < 0n 



M. 
n 



4 

■ 
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where (3 is an absolute constant, M 4 = sup^ E( lu;^- 1 4 ) . Thus 

E(X) < E(Z) + n [ Psc {t)dt + 0nJ — . 

JliUlr V n 

In the "edge" case we can choose C = (4/<5) 2,/3 , then because |/| > Q(6~ 2 / 3 (M A /n) 1 ^) , we have 



n / p sc (t)dt = Q(n(^f 2 ) > Q(nJ ^) 



and 



n [ p sc (t)dt + = 6(n(^) 3 / 2 ) = 9( -n f p sc (t)dt). 

In the "bulk" case we choose C = 4/5, then 



n I p sc {t)dt + (3nJ — = Q{rM) = Q{-n [ p sc {t)dt). 
Ji,\ji r V n C 4 Jj 

Therefore in both cases, with probability at least 1 — 0(exp(— ci & n R [^ )), we have 

5 f 5 f 

Z < X < E(X) + -n p sc (t)dt < E(Z) + -n / p 8C {t)dt. 



The convergence rate result of Gotze and Tikhomirov again gives 



l 



E{N T ) <n / p sc {t)dt + 0n\/^ < (1 + 5 -)n Jp M (t)dt, 



hence with probability at least 1 — 0(exp(— c\ & n R [^ )) 

Z < (1 + 8)n I Psc (t)dt, 

which is the desires upper bound. 



The lower bound is proved using a similar argument. Let /' = [a + b — I[ = [a, a + ^}], 
I' r = [b — Q,b] where C is to be chosen later and define two functions gi, g 2 as follows (see 
Figure |3|) : 



{— m{x — a) if x G (— oo,a) 
o if x e r u i\ u /; 

jyr(x - 6) if X G (6, OO) 
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rl(x — a) if x G (— oo, a + -g-) 



if x g /' 



c 



jr (rc — 6) if x G 



Define 



i=i 



i=l 



Applying Lemma 2.5 with T = |n Jj p sc (t)dt for and using the estimation for J T p(t)dt as 
above, we have 

5 C S 2 n 2 \I\ 5 
P(|F,-E(F J )| > -n Jpjfidt) <4exp(-c 2 -^l^). 

Let y = Yj. - Y 2 , then 



P(|F-E(F)| > 5 -n j p sc {t)dt) < 0(exp(-c 2 



£ 2 n 2 |I| 5 
K 2 C 2 



-))• 



We have E(Z — Y) < E{Np + iVj/). A similar argument as in the proof of the upper bound 
(using the convergence rate of Gotze and Tikhomirov) shows 



E(Y) > E(Z) - n [ Psc (t)dt - f3nj ^ > E{Z) - -n / p sc (t)dt. 



n 



Therefore with probability at least 1 — 0(exp(— c 2 ^^^-)), we have 

S f S f 

Z>Y> E(Y) - -n J p sc (t)dt > E(Z) - -n J pjf)dt, 

and by the convergence rate, with probability at least 1 — 0(exp(— c2 5 ™ 2 ^\ )) 



Z > (1 - S)n J p sc {t)dt. 



Thus, Theorem 2.2 is proved 
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Figure 3: Auxiliary functions used in the proof 



3 Infinity norm of the eigenvectors 



3.1 Small perturbation lemma 



A n is the adjacency matrix of G(n,p). In the proofs of Theorem 1.16 and Theorem 1.17, we 
actually work with the eigenvectors of a perturbed matrix 

A n + eN n , 

where e = e(n) > can be arbitrarily small and N n is a symmetric random matrix whose upper 
triangular elements are independent with a standard Gaussian distribution. 

The entries of A n +eN n are continuous and thus with probability 1, the eigenvalues of A n + eN n 
are simple. Let 

/ii < . . . < fi n 

be the ordered eigenvalues of A n + eN n , which have a unique orthonormal system of eigenvectors 
{wi, . . . ,w n }. By the Cauchy interlacing principle, the eigenvalues of A n + eN n are different 



from those of its principle minors, which satisfies a condition of Lemma 3.2 
Let Aj's be the eigenvalue of A n with multiplicity ki defined as follows: 

• • • Aj_i < Aj = Aj + i = . . . = Xi + k t < Aj + fc i+ i . . . 

By Weyl's theorem, one has for every 1 < j < n, 



< e\\K 



n 1 1 op 



0(6 



11 



(3.1) 
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Thus the behaviors of eigenvalues of A n and A n + eN n are essentially the same by choosing e 



sufficiently small. And everything (except Lemma 3.2) we used in the proofs of Theorem 1.16 



and Theorem 1.17 for A n also applies for A n + eN n by a continuity argument. We will not 



distinguish A n from A n + eN n in the proofs. 

The following lemma will allow us to transfer the eigenvector delocaliztion results of A n + eN n 
to those of A n at some expense. 

Lemma 3.1. In the notations of above, there exists an orthonormal basis of eigenvectors of 
A n , denoted by {ui, . . . , u n }, such that for every 1 < j < n, 

IK'Hoo < Halloo + oi(n), 
where a(n) can be arbitrarily small provided e(n) is small enough. 

Proof. First, since the coefficients of the characteristic polynomial of A n are integers, there 
exists a positive function l{n) such that either |A S — A t | = or |A S — A*| > l(n) for any 
1 < s, t < n. 



By (3.1 ) and choosing e sufficiently small, one can get 

\fJ>i - Ai-xl > l(n) and - \ i+ k z +i\ > l(n) 

For a fixed index i, let E be the eigenspace corresponding to the eigenvalue \ and F be the 
subspace spanned by {wi, . . . , 1^+^}. Both of E and F have dimension fcj. Let Pe and Pp be 
the orthogonal projection matrices onto E and F separately. 

Applying the well-known Davis-Kahan theorem (see [28] Section IV, Theorem 3.6) to A n and 

A n + eN n , one gets 

II r> oil ^ e 1 1 ^ n °P I \ 

\\Pe - Pf op < — rr~\ — := a \ n )i 

l[n) 

where a(n) can be arbitrarily small depending on e. 

Define Vj = PfWj G E for i < j < i + hi, then we have \\vj — Wj\\ 2 < a(n). It is clear that 
{vi, . . . , v^} are eigenvectors of A n and 

Halloo < Halloo + \ \vj - Wj\\ 2 < \\wj\\oo + a{n). 
By choosing e small enough such that na{n) < 1/2, {t>j, . . . , t> fc .} are linearly independent. 



Indeed, if YH7=i c j v j = 0> one nas f° r every i < s < i + ki, Yl k jLi c j(PF w ji w s) — 0, which impli 



ies 



c s = —^2j l =i Cj{PFWj — Wj,w s }. Thus \c s \ < a(n) X^/ij \ c j\> summing over all s, we can get 
Ylf=i \ c j\ — ka(n) Ylf=i \ c j\ an d therefore Cj = 0. 
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Furthermore the set {vi, . . . , v^} is 'almost' an orthonormal basis of E in the sense that 



| ] f s 1 1 2 — 1 | < \\v s — w s \\ 2 < a(n) for any i < s < i + ki 
\(v s ,v t )\ = \(P F W S , P F W t )\ 

= I (P F w s - w s , P F w t ) + (w s , P F w t - w t ) I 
= 0(a{n)) for any i<s^t<i + ki 

We can perform a Gram-Schmidt process on {t>j, . . . , u*-} to get an orthonormal system of 
eigenvectors {ui, . . . , u^} on E such that 

IK'lloo < II^Hoo + at(n), 

for every i < j < i + k^. 

We iterate the above argument for every distinct eigenvalue of A n to obtain an orthonormal 
basis of eigenvectors of A n . 

□ 



3.2 Auxiliary lemmas 
Lemma 3.2. (Lemma 41, [30]) Let 



B, 



a X* 
X B n -i 



be a n x n symmetric matrix for some a G C and X G C n , and let I ^ J be a eigenvector of 

B n with eigenvalue Xi(B n ), where x G C and v G C n_1 . Suppose that none of the eigenvalues 
of B n _i are equal to Xi(B n ). Then 

1 



x 



2 



1 + EpXA^n-i) - \i{B n ))- 2 \uj(B n -i)*X\ 2 
where Uj(B n _i) is a unit eigenvector corresponding to the eigenvalue \j(B n -i). 

The Stieltjes transform s n (z) of a symmetric matrix W is defined for z G C by the formula 

< \ -Is- 1 

s ^ z > - n 2^ Xi (w)- z - 

1=1 v ' 
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It has the following alternate representation: 

Lemma 3.3. (Lemma 39, JE3$) Let W = (Cij)i<i,j<n be a symmetrix matrix, and let z be a 
complex number not in the spectrum ofW. Then we have 

1 n 1 
S ^ Z) = n ^ ( kk -z-a* k (W k -zI)-ia k 

where W k is the (n — 1) x {n — 1) matrix with the k th row and column of W removed, and 
a k G C n_1 is the k th column of W with the k th entry removed. 



We begin with two lemmas that will be needed to prove the main results. The first lemma, 
following the paper [30] in Appendix B, uses Talagrand's inequality. Its proof is presented in 
the Appendix |B} 

Lemma 3.4. Let Y = (d, . . . ,(n) £ <C n be a random vector whose entries are i.i.d. copies of 
the random variable ( = £,—p (with mean and variance a 2 ). Let H be a subspace of dimension 
d and 7Th the orthogonal projection onto H. Then 

P(| || rt H (Y) || -aVd\ >t)< 10exp(--). 

In particular, 

|| n H (Y) \\=aVd + 0{u{^fi^n)) (3.2) 

with overwhelming probability. 



The following concentration lemma for G(n,p) will be a key input to prove Theorem 1.17 Let 
R — 1 A 

Lemma 3.5 (Concentration for ESD in the bulk). ( Concentration for ESD in the bulk) Assume 
p = g(n)logn/n. For any constants e, 5 > and any interval I in [—2 + e, 2 — e] of width 
\I\ = £1 (log 2 ' 2 g(n) log n/np), the number of eigenvalues Ni of B n in I obeys the concentration 
estimate 

\Ni(B n ) — n J p sc (x) dx\ < 5n\I\ 

with overwhelming probability. 



The above lemma is a variant of Corollary 2.3 



This lemma allows us to control the ESD on 

is a different 



a smaller interval and the proof, relying on a projection lemma (Lemma 3.4) 
approach. The proof is presented in Appendix O 
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3.3 Proof of Theorem 



1.16 



Let X n (A n ) be the largest eigenvalue of A n and u = (ui, . . . ,u n ) be the corresponding unit 
eigenvector. We have the lower bound X n (A n ) > np. And if np = w(logn), then the maximum 
degree A = (1 + o{l))np almost surely (See Corollary 3.14, [1]). 

For every 1 < % < n, 

\ n (A n )ui = 22 u Ji 

where N(i) is the neighborhood of vertex i. Thus, by Cauchy-Schwarz inequality, 

\E j& N {l ) U i\ . 1 

\\u oo = max,- — - — — — — < - — — — = (J( 



X n (A n ) X n (A n ) Jnp 



Let B n = -J ^A n . Since the eigenvalues of W n = -^^{A n — pJ n ) are on the interval [—2,2], by 
Lemma[0] {Ai(S n ), • • • , V-i( s n)} C [-2,2]. 



Recall that np = g{n) log n. By Corollary 2.3 , for any interval I with length at least ( g4°^i/2 ) 1 ^ 5 ( sa y 
5 = 0.5), with overwhelming probability, if 1 C [—2 + k, 2 — k] for some positive constant k, one 
has Ni(B n ) = Q(n L p sc (x)dx) = 0(n|/|); if / is at the edge of [—2,2], with length o(l), one 
has Nj(B n ) = 0(n L p sc (x)dx) = Q(n\I\ 3 ^ 2 ). Thus we can find a set J C {1, . . . , n — 1} with 
\J\ = Q(n\I \) or |J| = Q(n\I \ 3/2 ) such that \\j{B n ^ x ) - Xi(B n )\ < |/ | for all j G J, where 
B n _i is the bottom right (n — 1) x (n — 1) minor of B n . Here we take |/o| = (l/gin) 1 ^ 20 ) 2 ^ 3 . It 
is easy to check that |/ | > (^§172 ) 1/5 • 



By the formula in Lemma 3.2, the entry of the eigenvector of B n can be expressed as 



, |2 = 1 

~ 1 + (A,(£n-i) - HB^-^B^y^xi 2 

1 

" 1 + v,./(A ; (// !; j - x^y^B^y^xi 2 

~ ^ + Y. 3&J n- l \h\- 2 \u ] {B n _ 1 ylX\ 2 = 1 + n -i\l \-2\ M K W 
1 

" l + n-i|/ |- 2 |J| 

with overwhelming probability, where H is the span of all the eigenvectors associated to J 
with dimension dim(iJ) = @(| J|), tc^ is the orthogonal projection onto H and X G C n_1 has 
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entries that are iid copies of £. The last inequality in (3.3) follows from Lemma 3.4| (by taking 
t = gi(n) 1 ' 10 Vlogn) and the relations 

\\ir H {X)\\ = ||7r H (Y+pl n )|| > ||7r Hl (y + pl n )|| > HW^)II- 

Here Y = X — pl n and Hi — H D H2, where H 2 is the space orthogonal to the all 1 vector l n . 
For the dimension of Hi, dim(iJ!) > dim(H) — 1 . 

Since either |J| = Q(n\I \) or | J\ = ft(n|/ | 3/2 ), we have r^Vo^VI = ^d^op 1 ) or ti" 1 ! J | _a | J| = 
f2(|i |~ 1/ ' 2 )- Thus |x| 2 = O(|/o|) or \x\ 2 = O(y/\lo\). In both cases, since |/o| _ > 0, it follows 
that \x\ = o(l). □ 



3.4 Proof of Theorem 1.17 



With the formula in Lemma 3.2, it suffices to show the following lower bound 



n— 1 1 

YVa^-i) - A i (S n ))- 2 |^(S n _ 1 )*^X| 

vncr 

7=1 



rap 



, , log g{n)\ogn 

with overwhelming probability, where B n _i is the bottom right n — 1 x n — 1 minor of I? n and 
X G C 71 " 1 has entries that are iid copies of £. Recall that £ takes values 1 with probability p 
and with probability 1 — p, thus E£ = p, Var£ = p(l — p) = a 2 . 



By Theorem 



3.5 



we can find a set J C {1, . . . , n — 1} with \ J\ ^> — — 9 ^ log - such that 
\\j(B n _i) — \i(B n )\ = O (log 2 ' 2 g (n) log n/np) for all j E J. Thus in (3.4), it is enough to prove 



or equivalently 



\n H (X)\\ 2 ^ a 2 \J\ (3.5) 



with overwhelming probability, where H is the span of all the eigenvectors associated to J with 
dimension dim(if) = 0(| J|). 

Let Hi = H n H2, where if 2 is the space orthogonal to l n . The dimension of Hi is at least 



dim(H) — 1. Denote Y = X — pl n . Then the entries of Y are iid copies of £. By Lemma 3.4 

||7r Hl (r)|| 2 >a 2 |J| 
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with overwhelming probability. 

Hence, our claim follows from the relations 

| MX) 1 1 = \\n H (Y + pl n )\\ > ||7r Hl (y+pl n )|| = \\tt Hi (Y)\\. 



□ 



Appendices 



In this appendix, we complete the proofs of Theorem 1.3, Lemma 3.4 and Lemma 3.5 



A Proof of Theorem 1.3 



We will show that the semicircle law holds for M„. With Lemma 1.1, it is clear that Theorem 



1.3 follows Lemma A.l directly. The claim actually follows as a special case discussed in the 
paper [6]. Our proof here uses a standard moment method. 

Lemma A.l. For p = ou(-), the empirical spectral distribution (ESD) of the matrix W n = 
^M n converges in distribution to the semicircle law which has a density p sc (x) with support 
on [-2,2], 



Psc(x) := 



2tt 



Let rjij be the entries of M n = a _1 (y4 n — pJ n ). For i = j, rjij = —p/cr; and for i ^ j, i]^ are 
iid copies of random variable r], which takes value (l—p)/a with probability p and takes value 
— p/a with probability 1 — p. 



Er] = 0,Er] 2 = l,Erf = O 



for s > 2. 



For a positive integer k, the k moment of ESD of the matrix W n is 



/ x k dF^(x) = -E(Trace(W/ n fc )), 
J n 
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and the k th moment of the semicircle distribution is 

x k p sc (x)dx. 

2 



On a compact set, convergence in distribution is the same as convergence of moments. To 
prove the theorem, we need to show, for every fixed number k, 



1 f 2 

-E(Trace(W / n fc )) ->■ / x k p sc (x)dx, as n ->■ oo. 
n J-2 



For k = 2m + 1, by symmetry, j x p sc (x)dx = 0. 



2 

k 



For = 2m 



/2 1 Z" 2 2 fe+2 f^/ 2 
x k p sc (x)dx = — x k \fA — x 2 dx = / sin fc #cos 2 6dx 

■2 Wo 7T Jo 



2 *+2 r (fc±i)r(|) i /2m 



7T T(^) m + l\ m 



Thus our claim (A.l) follows by showing that 



0(^=) iffc = 2m + l; 

" 1 1 ( 2m ) +0(±) if k = 2m. 



We have the expansion for the trace of W n , 

-E(Trace(W n fc )) = - 1 
n n 



m+l V m ) \np> 
k 



E(Trace(W n fc )) = :3T ^E(Trace( C r- 1 M n ) fc ) 



n 

l<i\,...,i^<n 



(A.l) 



-E(Trace(l¥ n fc )) = { (A.2) 



1 ^ „ (A.3) 

l+fc/2 2 i *^ r lw2 r li2i'i ' ' ' Vikh 



Each term in the above sum corresponds to a closed walk of length k on the complete graph 
K n on {1,2, . . . ,n}. On the other hand, rjij are independent with mean 0. Thus the term is 
nonzero if and only if every edge in this closed walk appears at least twice. And we call such a 
walk a good walk. Consider a good walk that uses / different edges e\,...,e\ with corresponding 



20 



multiplicities mi, ...,mj, where / < m, each m^ > 2 and mi + . . . + mj = A;. Now the 
corresponding term to this good walk has form 

Since such a walk uses at most / + 1 vertices, a naive upper bound for the number of good 
walks of this type is n l+l x l k . 

When k = 2m + 1, recall Erf = 6 ((y/p) 2 ~ s ) for s > 2, and so 



i E (Trace(W^)) = £ £ ' ' ' < 

Z=l good walk of 1 edges 

< 1 yv +1 z fc f— r 1 " 2 f— r 



When k = 2m, we classify the good walks into two types. The first kind uses I < m— 1 different 
edges. The contribution of these terms will be 



m— 1 1 m 1 1 

\^ V" En Wl ---n W| < \^ T j'+l/fc/'_L v |"n- 2 (J_)mi-2 

n l+k/2 ^'tel ^ - n l+mZ^' L K ft 1 '" { /V 

1=1 1st kind of good walk of 1 edges 1=1 ^ F V F 

= 0(1-). 
np 

The second kind of good walk uses exactly I = m different edges and thus m + 1 different 
vertices. And the corresponding term for each walk has form 

Ei£...i£ = i. 

The number of this kind of good walk is given by the following result in the paper ( [Tj , Page 
617-618): 

Lemma A. 2. The number of the second kind of good walk is 

n m+1 (l + 0{n~ 1 )) /2m N 



m + 1 \ m 
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Then the second conclusion of (A.l) follows. 



B Proof of Lemma 13.4b 



The coordinates of Y are bounded in magnitude by 1 . Apply Talagrand's inequality to the map 
Y — > \ \7Th{Y)\\, which is convex and 1-Lipschitz. We can conclude 



P(| || n H (Y) || -M(|| n H (Y) ||)| > t) < 4exp(-^) (B.l) 
where M(|| tth{Y) ||) is the median of || tth{Y) \\. 

Let P = (Pij)i<i,j< n be the orthogonal projection matrix onto H. One has traceP 2 =traceP = 
YliiVu — d an d \pu\ < 1, as well as, 



II *h{Y) f = Yl PiiCiCj = ^PaCi + ^PijdQ 

l<i,j<n i=l i^j 

and 



E|| n H (Y) f = E(^^C 2 ) + E(£ Pij QQ = a 2 d. 

i=i i^j 

Take L = A/a. To complete the proof, it suffices to show 

|M(|| tth(Y) II) - aVd\ < La. (B.2) 

Consider the event S + that || 7Th(Y) ||> aL + ay/d, which implies that || tth(Y) || 2 > a 2 (L 2 + 
2LVd + d 2 ). 

Let S 1 = Yh=iPu{Q ~ °" 2 ) and S 2 = T,i^jPij(i(j- 
Now we have 



n 

P(S + ) < P(X>uCf > ° 2 d + LVda 2 ) + P(Y,PvGQ > o 2 L4d). 

i=l i^j 
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By Chebyshev's inequality, 



P(X>C 2 > * 2 d + LVda 2 ) = P(S 1 > LvQa 2 )) < 



i=i 



where E^l 2 ) = E(£ iPii (C 2 - a 2 )) 2 = E.PfMQ ~ ^) < da\l - 2a 2 ). 

Therefore, P(^ > Lv^a 4 ) < < ^ 

L z aa 4: lb 

On the other hand, we have E(|,S 2 | 2 ) = E(J2ifrPij(i(j) < ^ d and 

PCEPijGQ > ° 2 LV~d) = P(S 2 > LVda") < < 1 

It follows that E(£+) < 1/4 and hence M(|| ^(F) ||) < La + v^a. 

For the lower bound, consider the event £_ that || tth(Y) \\< \fdo — La and notice that 

P(£_) < P(5i < -LVda 2 ) + P(5 2 < -LVda 2 ). 



The same argument applies to get M(|| ^(F) ||) > yda — La. Now the relations (B.l) and 



(B.2) together imply (3.2). 



C Proof of Lemma 13.5b 



Recall the normalized adjacency matrix 

M n = ~{A n -pJ n ), 
a 

where J n = l n l™ is the n x n matrix of all l's, and let W n = -^M n . 
Lemma C.l. For all intervals I C R with \I\ = u (log n)/np, one has 

N z (W n ) = 0(n\I\) 

with overwhelming probability. 

23 



The proof of Lemma C.I uses the same proof as in the paper [30] with the relation (|3.2[). 



Actually we will prove the following concentration theorem for M n . By Lemma [TTJ \Nj(W n 
Nj(B n )\ < 1, therefore Lemma [C. 2 implies Lemma [3~5 



Lemma C.2. ( Concentration for ESD in the bulk) Assume p = g(n) logn/n. For any constants 
e, 5 > and any interval I in [—2 + s, 2 — e] of width \I\ = Q(g(n) os logn/np), the number of 
eigenvalues Nj of W n = A=M n in I obeys the concentration estimate 



Ni{W n ) -n p sc (x) dx\ < Sn\I\ 



with overwhelming probability. 



To prove Theorem C.2 following the proof in [20], we consider the Stieltjes transform 



s n (z :-- 



1 n 1 



whose imaginary part 



n 

I 1 \ ^ 7] 

lms n {x + V-i-Tj) = ~ > ^ 5 , 7T n , T v T7 



> 



in the upper half-plane rj > 0. 



The semicircle counterpart 



2 1 , , , 1 f 2 1 



s(z) := / p sc (x) dx = — / y/4 — x 2 dx, 

J-2 x — z 2tt J_ 2 x - z 

is the unique solution to the equation 

1 

s(z) + — = 

with lms(z) > 0. 

The next proposition gives control of ESD through control of Stieltjes transform (we will take 
L = 2 in the proof): 
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Proposition C.3. (Lemma 60, ISOtf) Let L,e,5 > 0. Suppose that one has the bound 

IsJz) - s(z)\ < 5 



with (uniformly) overwhelming probability for all z with |i?e(2;)| < L and Im(z) > 77. Then for 



any interval I in \—L + s, L — e] with \I\ > max(2r], ? log \), one has 



Nj — n Psc(%) dx\ < 5n\I\ 



1 



with overwhelming probability. 



By Proposition C.3 our objective is to show 



\s n {z)-s{z)\<8 (C.l) 
with (uniformly) overwhelming probability for all z with |Re(;?)| < 2 and Im(z) > 77, where 

log 2 g(n) \ogn 



np 



In Lemma 3.3 we write 



n 

-"'^ ^ (C ' 2) 

where 

F fe = a* k (W njk - zl)~ l a k , 

W n>k is the matrix W n with the k th row and column removed, and a k is the k th row of W n with 
the k th element removed. 

The entries of a k are independent of each other and of W n)k , and have mean zero and variance 
1/n. By linearity of expectation we have 

E(Y k \W n>k ) = -Traced - ziy 1 = (1 - -)s n>k (z) 
n n 

where 

1 n_1 1 

SnAz) = Xi(w n , k ) - z 
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is the Stieltjes transform of W nik . From the Cauchy interlacing law, we get 

\s n (z)-{l--)s n , k {z)\=0(- [ —!—-dx) = 0{—) = o(l), 



n n J R \x — z\ 2 ' nr\' 
and thus 

E{Y k \W n!k )=s n {z) + o{l). 
In fact a similar estimate holds for Y k itself: 

Proposition C.4. Fori < k < n, Y k = Fi(Y k \W ntk )+o(l) holds with (uniformly) overwhelming 
probability for all z with \Re[z)\ < 2 and Im(z) > r\. 



Assume this proposition for the moment. By hypothesis, \^n^\ = = Thus in (C.2) 
we actually get 



1 n 1 

S n (z) + -y^—- 7TT=0 (C.3) 

v 1 nf-f sJz) + z + oil) v ; 



k=l 



with overwhelming probability. This implies that with overwhelming probability either s n (z) = 
s(z) + o(l) or that s n (z) = —z + o(l). On the other hand, as lms n (z) is necessarily positive, 
the second possibility can only occur when Imz = o(l). A continuity argument (as in (TTJ) then 
shows that the second possibility cannot occur at all and the claim follows. 



Now it remains to prove Proposition C.4 



Proof of Proposition C.4[ Decompose 



Uj{W n , k )*a k \ 



and evaluate 



Y k - E(Y k \W n , k ) = Y k - (1 - -)s n>k {z) + o(l) 

n 



jrt HWn,k)-z +0[) (C.4) 
Y i X j{ W n ' t )-z + ° m - 
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where we denote Rj = \uj(W nt k)*ak\ , { u jiWn,k)} are orthonormal eigenvectors of W n ^. 



Let J C {1, . . . , n — 1}, then 



where if is the space spanned by {uj(W n> k)} for j G J and P# is the orthogonal projection 
onto if. 



In Lemma 3.4, by taking t = h(n)y/\ogn, where h(n) = log ' 001 g(n), one can conclude with 



overwhelming probability 



5>,i« 



J6J 



1 / h(n)y/\J\ logn /i(n) 2 logn 
" V VP P 



(C.5) 



Using the triangle inequality, 



h(n) 2 logn 



P 



(C.6) 



with overwhelming probability. 



Let z = x + v— T77, where rj = log <?(n) \ogn/np and |x| < 2 — e, define two parameters 

1 1 



log 4/3 ^(n) 



and /3 



log 1/3 ^(n)' 



First, for those j G J such that |Aj(W n ,k)— x\ < (3r), the function 



has magnitude 



O(-). From Lemma C.l \J\ <C n/3?7, and so the contribution for these j G J is, 



E 



eJ ^j(W n , k )-z nrj 



1 / h{nf 



0( 



log 2 g(n)J log 1/3 g(n 



oil). 



For the contribution of the remaining j G J, we subdivide the indices as 
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a < \Xj(W n> k) — x\ < (1 + a) a 
where a = (1 + a) /3r), for < I < L, and then sum over I. 



For each such interval, the function 



i 



-i'/ 



has magnitude O(-) and fluctuates by at 



most O(-). Say J is the set of all j's in this interval, thus by Lemma C.I \J\ = 0(naa). 



Together with bounds (C.5), (C.6), the contribution for these j on such an interval, 



E 



Rj | ^ J_ ( h(n)y/\J\\ogn + /j.(n) 2 logn \ + / + h(nf\ogn 
Xj(W n ,k) - z an\ y/p p J an \ p 



O 

o 



a 



h(n) 



+ 



h\n) 



n) 



1 h(n) , 1 
v / +alog — 
y/ap log g{n) p»7 



'l + a) l p\og 2 g{n) 



+ a' 



Summing over I and noticing that (1 + a) L n/g(n) 1 ^ 4 ' < 3, we get 



E 

jeJ,&xiJ 



R j 



Xj{W n , k ) - z 



O [ y + a\og — 

/a/3 log g{n) (3n 



O 



h(n) 



log 1 / 6 



9{n) 



0(1). 



□ 
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